Secure Multi-Party linear Regression
نویسندگان
چکیده
Increasing efficiency in hospitals is of particular importance. Studies that combine data from multiple hospitals/data holders can tremendously improve the statistical outcome and aid in identifying efficiency markers. However, combining data from multiple sources for analysis poses privacy risks. A number of protocols have been proposed in the literature to address the privacy concerns; however they do not fully deliver on either privacy or complexity. In this paper, we present a privacy preserving linear regression model for the analysis of data coming from several sources. The protocol uses a semi-trusted third party and delivers on privacy and complexity.
منابع مشابه
Poster: Secure Multi-Party Computation as a Tool for Privacy-Preserving Data Analysis
A Secure multi-party computation (MPC) protocol allows two or more parties to compute a function on sensitive input data provided by both parties, without revealing anything about the inputs (other than what can be inferred from the revealed output result). Social scientists often work with private datasets that cannot be shared due to legal restrictions and ownership issues, but many interesti...
متن کاملRegression on Distributed Databases via Secure Multi-Party Computation
We present a method for performing linear regression on the union of distributed databases that does not entail constructing an integrated database, and therefore preserves confidentiality of the individual databases. The method can be used by statistical agencies to share information from their individual databases, or to make such information available to others.
متن کاملRmind: a tool for cryptographically secure statistical analysis
Secure multi-party computation platforms are becoming more and more practical. This has paved the way for privacy-preserving statistical analysis using secure multi-party computation. Simple statistical analysis functions have been emerging here and there in literature, but no comprehensive system has been compiled. We describe and implement the most used statistical analysis functions in the p...
متن کاملSecure analysis of distributed chemical databases without data integration
We present a method for performing statistically valid linear regressions on the union of distributed chemical databases that preserves confidentiality of those databases. The method employs secure multi-party computation to share local sufficient statistics necessary to compute least squares estimators of regression coefficients, error variances and other quantities of interest. We illustrate ...
متن کاملSecure Approximation Guarantee for Cryptographically Private Empirical Risk Minimization
Privacy concern has been increasingly important in many machine learning (ML) problems. We study empirical risk minimization (ERM) problems under secure multi-party computation (MPC) frameworks. Main technical tools for MPC have been developed based on cryptography. One of limitations in current cryptographically private ML is that it is computationally intractable to evaluate non-linear functi...
متن کاملPrivacy-Preserving Distributed Linear Regression on High-Dimensional Data
We propose privacy-preserving protocols for computing linear regression models, in the setting where the training dataset is vertically distributed among several parties. Our main contribution is a hybrid multi-party computation protocol that combines Yao’s garbled circuits with tailored protocols for computing inner products. Like many machine learning tasks, building a linear regression model...
متن کامل